Hierarchical Learning from Natural Images
نویسندگان
چکیده
In this paper, we apply unsupervised learning methods to construct response functions for V1 simple cells, V1 complex cells, and V2 simple cells from a set of natural images. To support this, we reimplement existing sparse coding methods with the use of commercial optimization software. Introduction The human visual cortex contains a small number of self-contained functional units that fit together in reasonably well-understood pathways. The ventral pathway, which is concerned with object recognition, has four stages: V1, V2, V4, and IT (the inferior temporal lobule). V1 consists of simple cells that resemble localized, oriented Gabor filters and complex cells that respond to identical stimuli independent of phase. V1 outputs information to V2, which contains cells thought to respond to broader image contours. Since the landmark paper of Field & Olshausen (1996), it has been known that linear filters learned as sparse codings on datasets of natural images correspond almost exactly to the receptive fields of V1 simple cells. Hoyer & Hyvärinen (2000), among others, realized that learning an additional layer of nonlinear energies replicated the behavior of V1 complex cells. V1 has received the great majority of research focus in this area because its behavior is well understood and, perhaps, of suspicion that simple information-theoretic elements should not apply to higher levels. However, Hoyer & Hyvärinen (2002) demonstrates that similar sparse-coding techniques may yield the contour activation patterns one would expect in V2. In this paper, we expand on these results in two ways. First, we introduce learned features at every level, feeding forward from the image to V1 and from V1 to V2. This is in contrast to the hand-coded V1
منابع مشابه
Hierarchical k-Means for Unsupervised Learning
In this paper we investigate how to accelerate k-means based unsupervised learning algorithms with hierarchical k-means. We show that hierarchical k-means significantly speeds up k-means based learning approaches in both the training and query phases at minimal cost to test accuracy. This speedup allows for much larger numbers of centroids to be used, which in turn leads to much better learning...
متن کاملMachine Learning of Hierarchical Clustering to Segment 2D and 3D Images
We aim to improve segmentation through the use of machine learning tools during region agglomeration. We propose an active learning approach for performing hierarchical agglomerative segmentation from superpixels. Our method combines multiple features at all scales of the agglomerative process, works for data with an arbitrary number of dimensions, and scales to very large datasets. We advocate...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کاملCharacterizing imageability in Gajar houses of Tabriz
Aims: From Lufor’s perspective, space is constructed based on spatial operation, recreation and the space in which recreation takes place. This approach is a descriptive view of the relationship between space from a materialistic point of view and its dominant ideas with its dwellers. In this perspective, humans integrate distinct and indistinct data of space and create map-like mental images f...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کاملShoslif-o: Shoslif for Object Recognition and Image Retrieval (phase Ii)
We describe a self-organizing framework for content-based retrieval of images from large image databases. The SHOSLIF (Self-organizing Hierarchical Optimal Subspace Learning and Inference Framework) system uses the theories of optimal linear projection for automatic optimal feature selection and a hierarchical image database organization to achieve a logarithmic retrieval complexity. We use inc...
متن کامل